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Introduction. It is known that fires are one of the most large-scale emergencies. It is possible to systematize and 
formalize their causes only if you take into account the effective analysis of statistical data. The scientific problem lies 
in the lack of effective mathematical tools and techniques that allow the use of fire statistics as an emergency prevention 
tool. The solution of this problem is relevant for science and technology. Based on the identified problem, the purpose 
of this study is formulated, which consists in the analysis of fire statistics and its formalization in predicting 
emergencies. 

Problem Statement. The objective of this study is to analyze the state and causes of fires, as well as to find a tool for 
their prediction. 

Theoretical Part. The methodological tools for solving this problem are the use of multiple regression and correlation 
analysis methods that allow criticizing and formalizing the available fire statistics. It is established that an acceptable 
parameter characterizing the reliability and closeness of the connection of empirical data with their mathematical 
function in relation to the task is the correlation coefficient. 

Conclusions. It is proved that an effective tool for predicting fires is the use of linear regression analysis methods. The 
practical significance of the results obtained for science and technology lies in the possibility of creating digital tools for 
predicting and preventing emergencies, which will significantly reduce resource costs for eliminating their 


consequences. 
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Introduction. It is known that the occurrence of emergency situations is a multifactorial process that depends 
on a large number of components. One of the most frequently occurring categories of emergencies is fire. Fires, 
regardless of the object, lead to significant, and in some cases catastrophic, socio-economic and technological 
consequences. The work of the State Fire Service of the Ministry of Emergency Situations of Russia is aimed at 
preventing and minimizing such situations. 

Information about fires at industrial and civil facilities is published annually in open sources [1-4]. This 
Statistic is of a precautionary nature, since it allows us to assess the relationship between some of the factors leading to 
the occurrence of an emergency. Based on the analysis of sources [1—4], it was possible to identify the following factors 
leading to fires: violation of the rules of operation of electrical equipment and household electrical appliances, 
malfunction of production equipment, violation of the technological process of production, careless handling of fire, 
cases when children play with fire, violation of fire safety rules during electric and gas welding works, explosions, 
spontaneous combustion of substances and materials, malfunction and violation of the rules of operation of furnace 
heating, arson and other unidentified causes. However, despite the consistency of the information provided, there are no 
tools to formalize it and build a probabilistic forecast of the development of emergencies in the current conditions. 
These forecasts are necessary to indicate the significance of preventive measures with the presentation of indicative 
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socio-cultural losses. Based on the above, we believe that the analysis of fire statistics and its formalization is a 
practically significant, urgent task for both industry and the civilian population. 

The aim of the study is the analysis of fire statistics and its formalization in predicting emergencies. 

Problem Statement. Representative statistical data on the number of fires are the values published in [1-4]. 
These materials are reliable and have convergence with the results published in [5-7]. 

It is established that a promising tool for the formalization of statistical data is the use of multiple regression 
analysis methods, the essence of which can be summarized using mathematical coefficients that estimate the value of 
the reliability of the functional approximation of a certain data array. Based on the analysis of [8—10], it has been found 


that, in relation to the task, the most rational is the use of linear regression models of the form: 


kK k 
y = bo +) bx +). byxixy (1) 
i=1 i=1 


where bo, bj, bj, bj — coefficients characterizing the strength of the influence of free, linear, quadratic and paired 
interaction effects of the mathematical model; x; x; — factors the influence of which on the response under 
consideration is demonstrated by the mathematical model. 

We take the correlation coefficient as the parameter that most representatively characterizes the reliability and 
closeness of the connection of empirical data with their mathematical function. The practical experience of assessing the 
reliability of statistical models using the correlation coefficient is presented in [11, 12]. Let us consider the general 
regularities of the algorithm for using the correlation coefficient. Let us suppose there is a sample n of the values 
(X1, V1), (X25 V2),--+5 (Xp, Vy) With their joint distribution, then the correlation coefficient is determined as: 

a din %i = MY —y) . 
air — 2)" Per 
The correlation coefficient between x and y evaluates an empirical measure of the linear relationship between 


(2) 


them. Moreover, nx =}, x;, ny = yj. If we put multipliers /n-/) before all sums, then 7, will allow us to take into 
account variances and covariance with their sample estimates replaced. However, in relation to the task, we will limit 
ourselves to the representation of the numerical values of the correlation coefficient relative to —1 and +1, where the 
value —1 indicates the absence of a connection, and +1 indicates its presence. 

After choosing a mathematical tool for formalizing statistics, we will proceed to the analysis of empirical data 
necessary for the construction of probabilistic forecasts. Table 1 presents data on fires in the Russian Federation for 
2015-2020, published in [1]. 


Table 1 
Statistics of fires in the Russian Federation for 2015-2020 
Year / 
Emergency 1 2 3 4 ey 6 7 8 9 10 11 
factor 
2015 10228 | 25456 338 29243 | 1414 618 54 305 13502 | 1184 9376 
2016 9034 | 25118 300 25828 | 1279 552 50 287 13683 1369 8761 
2017 8296 | 24995 318 24255 | 1100 549 40 273 12912 | 1450 8056 
2018 7698 | 25868 351 22668 | 1080 531 47 298 14087 | 1378 8009 
2019 8814 | 25360 327 25498 | 1218 563 47 291 13546 | 1345 8551 
2020 8296 | 24995 318 24255 | 1100 549 40 273 12912 | 1450 8056 
Note to the table: the emergency occurrence factor: 1 — violation of the rules of operation of electrical 


equipment and household electrical appliances; 2 — arson; 3 — malfunction of production equipment, violation of the 
technological process of production; 4 — careless handling of fire; 5 —children play with fire; 6 — violation of fire 
safety rules during electric and gas welding works; 7 — explosions; 8 — spontaneous combustion of substances and 
materials; 9 — malfunction and violation of the rules of operation of furnace heating; 10 — unidentified; 11 — other 
causes. 


Theoretical Part. Figure | shows the distribution of the number of fires depending on the year. 
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Fig. 1. Distribution of the number of fires depending on the year 


The analysis of the presented statistics allowed us to establish that: 

— linear model (3) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires from violations of the rules of operation of electrical equipment and household electrical appliances at the level of 
Statistical significance a = 0.05: 

y =—598.00x + 1 216 256.00, (3) 
while the correlation coefficient is r= 1.0; 

— linear model (4) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires from other causes at the level of statistical significance a = 0.05: 

y =-47x + 102 996, (4) 
while the correlation coefficient is r = 1.0; 

— linear model (5) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires caused by children’s play with fire at the level of statistical significance o = 0.05: 

y =—20x + 41 500, (5) 
while the correlation coefficient is r= 1.0; 

— linear model (6) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires from malfunction of production equipment, violation of the technological process of production at the level of 
statistical significance a = 0.05: 

y = 33x — 66 342, (6) 
while the correlation coefficient is r= 1.0; 

— linear model (7) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires from careless handling of fire at the level of statistical significance a = 0.05: 

y =—1 587.00x + 3 229 995,00, (7) 
while the correlation coefficient is r = 1.0; 

— linear model (8) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires from violations of fire safety rules during electric and gas welding operations at the level of statistical significance 
a = 0.05: 

y =—18x + 36 909, (8) 
while the correlation coefficient is r = 1.0; 

— linear model (9) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires during explosions at the level of statistical significance a = 0.05: 

y = 7x— 14 100, (9) 
while the correlation coefficient is r = 1.0; 

— linear model (10) characterizes with a sufficient degree of reliability the annual distribution of the number of 
spontaneous combustion of substances and materials at the significance level o = 0.05: 

y = 25x — 50 227, (10) 
while the correlation coefficient is r= 1.0. 
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— linear model (11) characterizes with a sufficient degree of reliability the annual distribution of the 
number of fires from malfunction and violation of the rules of operation of furnace heating at the level of 
statistical significance a = 0.05: 

y = 1 175.00x — 2 360 588.00, (11) 
while the correlation coefficient is r= 1.0. 

— linear model (12) characterizes with a sufficient degree of reliability the annual distribution of the number of 
unidentified fires at the level of statistical significance a = 0.05: 

y =—72x + 146 890, (12) 
while the correlation coefficient is r= 1.0. 

— linear model (13) characterizes with a sufficient degree of reliability the annual distribution of the number of 
fires due to arson at the significance level a = 0.05: 

y = 873.00x — 1 738 465.00, (13) 
while the correlation coefficient is r= 1.0. 

Conclusions. From the above we can conclude: 

— based on the statistics from 2015 to 2020, it was found that the largest share of fires falls on arson and 
careless handling of fire; 

— it is proved that an effective tool for predicting fires is the use of linear regression analysis methods; 

— formalization of fire statistics using linear regression models makes it possible to structure and digitalize the 
available data arrays in terms of the criteria presented; 

— mathematical structuring of statistics from the point of view of the presented criteria allows the use of data 
arrays in an automated mode. 
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